DC express: shortest latency protocol for reading phase change memory over PCI express

نویسندگان

  • Dejan Vucinic
  • Qingbo Wang
  • Cyril Guyot
  • Robert Mateescu
  • Filip Blagojevic
  • Luiz Franca-Neto
  • Damien Le Moal
  • Trevor Bunker
  • Jian Xu
  • Steven Swanson
  • Zvonimir Bandic
چکیده

Phase Change Memory (PCM) presents an architectural challenge: writing to it is slow enough to make attaching it to a CPU’s main memory controller impractical, yet reading from it is so fast that using it in a peripheral storage device would leave much of its performance potential untapped at low command queue depths, throttled by the high latencies of the common peripheral buses and existing device protocols. Here we explore the limits of communication latency with a PCM-based storage device over PCI Express. We devised a communication protocol, dubbed DC Express, where the device continuously polls read command queues in host memory without waiting for host-driven initiation, and completion signals are eliminated in favor of a novel completion detection procedure that marks receive buffers in host memory with incomplete tags and monitors their disappearance. By eliminating superfluous PCI Express packets and context switches in this manner we are able to exceed 700,000 IOPS on small random reads at queue depth 1.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand

InfiniBand is emerging as a high-performance interconnect. It is gaining popularity because of its high performance and open standard. Recently, PCI-Express, which is the third generation high-performance I/O bus used to interconnect peripheral devices, has been released. The third generation of InfiniBand adapters allow applications to take advantage of PCI-Express. PCI-Express offers very low...

متن کامل

START JR A Parallel System from Commodity Technology

StarT jr is an experimental parallel system composed of a network of personal computers PCs The system leverages the momentum of the microprocessor and PC industries to achieve excellent single node performance at a low cost For parallel processing StarT jr uses the Flexible User level Network Inter face FUNi to provide low overhead user level interprocessor communication over two IEEE High Per...

متن کامل

StarT - Jr : A Parallel System from

StarT-jr is an experimental parallel system composed of a network of personal computers (PCs). The system leverages the momentum of the microprocessor and PC industries to achieve excellent single node performance at a low cost. For parallel processing, StarT-jr uses the Flexible User-level Network Interface (FUNi) to provide low-overhead, user-level inter-processor communication over two IEEE ...

متن کامل

SpaceWire-HS Host Adapter – An FPGA based PCI Express Device for Versatile High-Speed Channels

Robotic systems like the DLR Hand Arm System that feature control cycles beyond 1 kHz demand a deterministic and low latency communication. Therefore, DLR is working on high-speed SpaceWire. This paper presents the SpaceWire-HS host adapter, a FPGA driven PCI Express device for high-speed SpaceWire. The adapter provides a generic host interface for QNX real-time hosts, supported by a client C++...

متن کامل

Spartan - 6 FPGA Connectivity

www.xilinx.com 1 © 2010 Xilinx, Inc. XILINX, the Xilinx logo, Virtex, Spartan, ISE, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. PCI, PCIe, and PCI Express are trademarks of PCI-SIG and used under license. All other trademarks are the property of their respective owners. This white paper discusses the observed performance of the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014